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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/873,224 



DATE: 01/11/2002 
TIME: 22:32:18 



INPUT SET: S36717.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages* 
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SEQUENCE LISTING 

(1) General Information: 

(i) APPLICANT: 

(A) NAME: Innogenetics sa. 

(B) STREET: Indus triepark Zwijnaarde 7, box 4 

(C) CITY: Ghent 

(E) COUNTRY: Belgium 

(F) POSTAL CODE (ZIP): B-9052 

(G) TELEPHONE: 00 32 9 241 07 11 

(H) TELEFAX: 00 32 9 241 07 99 

(ii) TITLE OF INVENTION: New sequences of hepatitis C virus 
genotypes for diagnosis , prophylaxis and therapy. 

(iii) NUMBER OF SEQUENCES : 270 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/873,224 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER : 08/362 , 455 

(B) FILING DATE: 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



PAGE: 2 RAW SEQUENCE LISTING DATE- 01/1 1/2002 

PATENT APPLICATION US/09/873,224 TIME: 32: 18 

, . . . , flVPlTT S£7V S36717.raw 

47 (ill) ANTI-SENSE: NO 

48 

4 9 (vii) IMMEDIATE SOURCE: 

50 (B) CLONE: BR34-4-20 

51 

52 (ix) FEATURE: 

53 (A) NAME / KEY : CDS 

54 (B) LOCATION: 1..213 
55 

5 6 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1- 
57 

58 CTC ACG GAA CGG CTT TAC TGC GGG GGC CCT ATG TTC AAC AGC AAG GGG 4 8 

59 Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly 
SO 1 5 10 15 

61 

62 GCC CAG TGT GGT TAT CGC CGC TGC CGT GCC AGT GGA GTT CTG CCT ACC 96 

63 Ala Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 

64 20 25 30 
65 

66 AGC TTC GGC AAC ACA ATC ACT TGC TAC ATC AAG GCC ACA GCG GCT GCA 144 

67 Ser Phe Gly Asn Thr He Thr Cys Tyr He Lys Ala Thr Ala Ala Ala 

68 35 40 45 
69 

70 AGG GCC GCA GGC CTC CGG AAC CCG GAC TTT CTT GTC TGC GGA GAT GAT 192 

71 Arg Ala Ala Gly Leu Arg Asn Pro Asp Phe Leu Val Cys Gly Asp Asp 

72 50 55 60 
73 

74 CTG GTC GTG GTG GCT GAG AGT on 

75 

76 

77 

7 8 Leu Val Val Val Ala Glu Ser 
79 65 70 
80 

81 (2) INFORMATION FOR SEQ ID NO : 2: 
82 

83 (i) SEQUENCE CHARACTERISTICS: 

84 (A) LENGTH: 71 amino acids 
S5 (B) TYPE: amino acid 

36 (D) TOPOLOGY: linear 

87 

88 (ii) MOLECULE TYPE: protein 

89 

90 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2* 

91 

92 Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly 

93 1 5 10 i 5 

94 

95 Ala Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 

96 20 25 30 
97 

98 Ser Phe Gly Asn Thr He Thr Cys Tyr He Lys Ala Thr Ala Ala Ala 

99 35 40 45 



PAGE: 3 ?^^ QUENCE LISTING DATE . 01 n , nom 

PATENT APPLICATION US/09/873,224 ^SSl!S^ 

100 INPUT SET: S36717.raw 

101 Arg Ala Ala Gly Leu Arg Asn Pro Asp Phe Leu Val Cys Gly Asp Asp 

103 55 60 

104 Leu Val Val Val Ala Glu Ser 

105 65 70 
106 

107 (2) INFORMATION FOR SEQ ID NO* 3- 
108 

109 (D SEQUENCE CHARACTERISTICS: 

110 (A) LENGTH: 213 base pairs 

111 (B) TYPE: nucleic acid 

112 (C) STRANDEDNESS : single 

113 (D) TOPOLOGY: linear 
114 

115 (ii) MOLECULE TYPE: cDNA 
116 

117 (vii) IMMEDIATE SOURCE: 

118 (B) CLONE: BR36-23-18 
119 

120 fix) FEATURE: 

121 (A) NAME /KEY : CDS 

122 (B) LOCATION: 1..213 
123 

124 < xi ) SEQUENCE DESCRIPTION: SEQ ID NO- 3- 
125 

12? Leu J£ ? G T CTT TGC GGG GGC CCT ATG TTC ^ C AGC AAG GGG 48 

127 Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly 

15 10 



15 



128 
129 

l 1 ?? J 00 %° TGT GGT TAT CGC CGC TGC CGT GCC AGT GGA GTT CTG CCT ACC 96 
131 Ala Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 
20 — 

133 
134 



25 30 



135 ter III f° J? ^ A ° T TG ° TAC ATC GCC ACA GGG GCT GCA 144 

135 ser Phe Gly Asn Thr He Thr Cys Tyr lie Lys Ala Thr Ala Ala Ala 



137 
138 



40 45 



IV. J 06 GCA G ? C CTC CGG ^ CCG GAC TTT CTT GTC TG C GGA GAT GAT 192 

III 9 50 7 ^ ^ ASP L6U Val C ^ S G1 ^ A ^ A ^P 



141 



55 60 



142 CTG GTC GTG GTG GCT GAG AGT 

14 3 Leu Val Val Val Ala Glu Ser 213 

144 

145 

146 

147 65 70 
148 

149 (2) INFORMATION FOR SEQ ID NO ■ 4- 
150 

151 (i) SEQUENCE CHARACTERISTICS: 

152 (A) LENGTH: 71 amino acids 



"3 (B) TYPE: amino acid INPUT SET: S36717.raw 

154 (D) TOPOLOGY: linear 

155 

156 (ii) MOLECULE TYPE: protein 

157 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

160 
161 
162 

163 Ala Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 



Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly 
5 10 15 



164 20 
165 



25 



30 



166 Ser Phe Gly Asn Thr lie Thr Cys Tyr lie Lys Ala Thr Ala Ala Ala 



167 35 
168 



40 



45 



169 Arg Ala Ala Gly Leu Arg Asn Pro Asp Phe Leu Val Cys Gly Asp Asp 

171 60 

172 Leu Val Val Val Ala Glu Ser 

173 65 70 
174 

175 (2) INFORMATION FOR SEQ ID NO- 5* 
176 

177 (i) SEQUENCE CHARACTERISTICS: 

178 (A) LENGTH: 213 base pairs 

179 (B) TYPE: nucleic acid 

180 (C) STRANDEDNESS : single 

181 (D) TOPOLOGY: linear 
182 

183 (ii) MOLECULE TYPE: cDNA 

184 

185 (iii) HYPOTHETICAL: NO 

186 

187 (iii) ANTI-SENSE: NO 

188 

189 (vii) IMMEDIATE SOURCE: 

190 (B) CLONE: BR36-23-18 
191 

X92 (ix) FEATURE: 

193 (A) NAME /KEY : CDS 

194 (B) LOCATION: 1..213 
195 

196 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

198 CTC ACG GAG CGG CTT TAC TGC GGG GGC CCT ATG TTT AAC AGr *A G riQQ * Q 

200 
201 

20? ^ C l GT TAT CGC CGT TGC CGT GCC AGT GGA GTT CTG CCT ACC 96 

203 Ala Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro 5£r 



_ ^^j. /-i. j. o iii aat Aijt AAG 

«u Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly 
5 10 15 



204 
205 



25 30 



PAGE: 5 



206 

207 

208 

209 

210 

211 

212 

213 

214 

215 

216 

217 

218 

219 

220 

221 

222 

223 

224 

225 

226 

227 

228 

229 

230 

231 

232 

233 

234 

235 

236 

237 

238 

239 

240 

241 

242 

243 

244 

245 

246 

247 

248 

249 

250 

251 

252 

253 

254 

255 

256 

257 

258 



RAW SEQUENCE LISTING date- oi/i l /2002 

PATENT APPLICATION US/09/873,224 im^n^ 

INPUT SET: S36717.raw 

AGC TTC GGC AAC ACA ATC ACT TGT TAC ATC AAA GCC ACA GCG GCC GCA 144 
Ser Phe Gly Asn Thr He Thr Cys Tyr He Lys Ala Thr Ala Ala Ala 
35 40 45 

AAA GCC GCA GGC CTC CGG AGC CCG GAC TTT CTT GTC TGC GGA GAT GAT 192 
Lys Ala Ala Gly Leu Arg Ser Pro Asp Phe Leu Val Cys Gly Asp Asp 
50 55 60 



CTG GTC GTG GTG GCT GAG AGT 
Leu Val Val Val Ala Glu Ser 
65 70 



(2) INFORMATION FOR SEQ ID NO : 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly 
1 5 io 15 

Ala Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 
20 25 30 

Ser Phe Gly Asn Thr He Thr Cys Tyr He Lys Ala Thr Ala Ala Ala 
35 40 45 

Lys Ala Ala Gly Leu Arg Ser Pro Asp Phe Leu Val Cys Gly Asp Asp 
50 55 60 

Leu Val Val Val Ala Glu Ser 
65 70 

(2) INFORMATION FOR SEQ ID NO : 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 



PAGE 1 SEQUENCE VERIFICATION REPORT date oi/l 1/2002 

PATENT APPLICATION US/09/873,224 TIME: 22 : 32: 2^ 



INPUT SET: S36717.raw 



Line 



Error 



5 
6 
7 
8 
9 

10 
11 
12 



Mandatory Value Not Present 
Unknown or Misplaced Identifier 
Unknown or Misplaced Identifier 
Unknown or Misplaced Identifier 
Unknown or Misplaced Identifier 
Unknown or Misplaced Identifier 
Unknown or Misplaced Identifier 
Unknown or Misplaced Identifier 



Original Text 



(i) APPLICANT: 

(A) NAME: Innogenetics sa. 

(B) STREET: Industriepark Zwijnaarde 7, box 4 

(C) CITY: Ghent 

(E) COUNTRY: Belgium 

(F) POSTAL CODE (ZIP): B-9052 

(G) TELEPHONE: 00 32 9 241 07 11 

(H) TELEFAX: 00 32 9 241 07 99 





PAGE: 1 




DATE: 01/11/2002 
TIME: 22:32:20 



INPUT SET: S36717.raw 



ADDRESSEE 

STREET 

CITY 

STATE 

COUNTRY 

ZIP 

CORRESPONDENCE ADDRESS 



PAGE: 1 SEQUENCE CORRECTION REPORT DATE 01/11/2002 

PATENT APPLICATION US/09/873,224 TIME: 22:32:20 

INPUT SET: S36717.raw 



Line 


Original Text 




Corrected Text 




4037 


(C) INDIVIDUAL ISOLATE: 


BR33 and BR36 


(C) INDIVIDUAL ISOLATE: 


BR33 and BR36 


4061 


(C) INDIVIDUAL ISOLATE: 


HD10 


(C) INDIVIDUAL ISOLATE: 


HD10 


4086 


(C) INDIVIDUAL ISOLATE: 


BR36 


(C) INDIVIDUAL ISOLATE: 


BR36 


4110 


(C) INDIVIDUAL ISOLATE: 


BR36 


(C) INDIVIDUAL ISOLATE: 


BR36 


4137 


(C) INDIVIDUAL ISOLATE: 


HD10 


(C) INDIVIDUAL ISOLATE: 


HD10 


4188 


(C) INDIVIDUAL ISOLATE: 


BR36 


(C) INDIVIDUAL ISOLATE: 


BR36 


4215 


(C) INDIVIDUAL ISOLATE: 


HD10 


(C) INDIVIDUAL ISOLATE: 


HD10 


5305 


(C) INDIVIDUAL ISOLATE: 


GB358 


(C) INDIVIDUAL ISOLATE: 


GB358 


5327 


(C) INDIVIDUAL ISOLATE: 


GB549 


(C) INDIVIDUAL ISOLATE: 


GB549 


5349 


(C) INDIVIDUAL ISOLATE: 


GB809 


(C) INDIVIDUAL ISOLATE: 


GB809 


5370 


(C) INDIVIDUAL ISOLATE: 


GB358 


(C) INDIVIDUAL ISOLATE: 


GB358 


5393 


(C) INDIVIDUAL ISOLATE: 


GB549 


(C) INDIVIDUAL ISOLATE: 


GB549 


5414 


(C) INDIVIDUAL ISOLATE: 


GB809 


(C) INDIVIDUAL ISOLATE: 


GB809 


5436 


(C) INDIVIDUAL ISOLATE: 


GB358 


(C) INDIVIDUAL ISOLATE: 


GB358 


5459 


(C) INDIVIDUAL ISOLATE: 


GB549 


(C) INDIVIDUAL ISOLATE: 


GB549 


5481 


(C) INDIVIDUAL ISOLATE: 


GB809 


(C) INDIVIDUAL ISOLATE: 


GB809 


5503 


(C) INDIVIDUAL ISOLATE: 


GB358 


(C) INDIVIDUAL ISOLATE: 


GB358 


5524 


(C) INDIVIDUAL ISOLATE: 


GB549 


(C) INDIVIDUAL ISOLATE: 


GB549 


5545 


(C) INDIVIDUAL ISOLATE: 


GB809 


(C) INDIVIDUAL ISOLATE: 


GB809 


5567 


(C) INDIVIDUAL ISOLATE: 


GB358 and GB809 


(C) INDIVIDUAL ISOLATE: 


GB358 and GB809 


5590 


(C) INDIVIDUAL ISOLATE: 


GB549 


(C) INDIVIDUAL ISOLATE: 


GB549 


5612 


(C) INDIVIDUAL ISOLATE: 


GB549 


(C) INDIVIDUAL ISOLATE: 


GB549 



